o 

(N 



O 



Testing the characteristics of a Levy process 



Markus ReiB 



■^ | Institute of Mathematics 

Humboldt- Universitat zu Berlin 
mreiss@mathematik.hu-berlin.de 

H 

ITl : April 5, 2013 

Abstract 

For n equidistant observations of a Levy process at time distance A„ we consider the 
^- , problem of testing hypotheses on the volatility, the jump measure and its Blumenthal- 

00 for both, the high-frequency regime A„ = — and the low-frequency regime A„ = 1 as well 

o 



as intermediate cases. The approach via empirical characteristic function unifies existing 
theory and sheds new light on diverse results. Particular emphasis is given to asymptotic 
C*~) ' separation rates which reveal the complexity of these basic, but surprisingly non-standard 

inference questions. 
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1 Introduction 

Jump processes have become a key model in stochastics over the recent years. On one hand 
this is due to an increasing need for modelling stochastic processes with jumps in areas ranging 
from physics and biology to finance and economics. On the other hand, there is a more and 
more profound understanding of the theory and properties of jump processes. While a general 
framework is certainly provided by semimartingale theory, Levy processes remain the basic 
building blocks. Many general results are derived by showing a good approximation by a 
Levy process, a semimartingale with constant and independent characteristics, on short time 
intervals. The statistics for jump processes and especially Levy processes has consequently 
attracted increasing attention over the last decade and is still a very active field of research. 
It has fascinating links to different fields of probability, statistics and analysis and is a source 
of important, suggestive and highly non-trivial problems. 

Here we concentrate on hypothesis testing based on observations of a Levy process (Xt , t ^ 
0). We assume throughout E[X^] < oo such that the Levy-Khinchine formula in terms of the 
characteristic triplet T = (cr 2 , b, v) can be described by 

K[e iuXA ] = exp(AV>(u)), ^{u) = ibu - ^-u 2 + / \e iux - 1 - iux)v(dx). 

We are interested in hypotheses on the volatility o 1 ^ 0, the drift 5sM and the Levy or jump 
measure v. In our observation model we dispose of equidistant observations Xq, X/±, . . . , X n /± 
at time distance A > 0. We shall adopt an asymptotic point of view and let n — > oo such 
that all quantities may be indexed by n. The distance A n may remain fixed which is up to a 
simple normalisation the so-called low-frequency setting A n = 1 or the observation time nA n 
may remain fixed which is up to a factor the high-frequency setting A n = — (also called in-fill 
asymptotics in econometrics) or A n may have an asymptotic behaviour inbetween, whence we 
always assume A n G [— , 1]- The different asymptotics of A n lead to very different statistical 
procedures and one purpose of the present work is to provide a unifying framework to reconcile 
these sometimes quite disjoint strands of literature. 

In the pure Levy case, where the characteristics are well understood in the characteristic 
exponent ip, it is natural to base the inference on the empirical characteristic function of the 
i.i.d. increments AjJX := XkA n — ^(it-i)A n (double usage of the symbol A is unfortunately 
standard): 

n *■ — ^ 



n 
fe=i 



Testing the characteristics of a Levy process 3 

We shall deal with complex-valued random variables Z{ and define Covc(^i, -^2) = E\Z\Z^ — 
E[Zi]E[Z 2 ], Varc(^i) = E[|Zi - E[Z X ]| 2 ]. With ip n (u) = E[e iuA k x ] standard calculations 
yield E[ip n (u)] = <p n (u), Covc(<Pn(u),<Pn(v)) = ^(<Pn(u - v) - ip n (u)ip n (-v)), Var c (^ n (n)) = 
k(l-\<Pn(u)\ 2 ). 

For the high-frequency setting inference is often based on truncating the increments A^X 
such that the diffusive part is separated from larger jumps. As we shall see, there are many 
parallels of these two approaches and the characteristic function ansatz provides also new 
insights in the case where we might approximate well a continuous-time observation statistics. 
On the other hand, as we shall see, a direct method of moments may be well suited even for 
the low-frequency setting. 

The test problems we consider are grouped according to the object of interest: the volatility 
in Section 2, the degree of jump activity in Section 3 and the global jump measure in Section 
4. Inference on the drift b is straightforward since in our definition E[A^X] = A n b and 
a natural and in many senses efficient estimator is given by b n = {n/S. n )~ l X n ^ n . When 
testing for instance the volatility a 2 we want to be as agnostic as possible with respect to 
the jump measure and the drift which, of course, confound our observations. In fact, we are 
facing nonparametric testing problems in the sense of Ingster and Suslina |15j : we consider 
on an infinite-dimensional class O a finite or infinite dimensional hypothesis set ©o and 
set 0i (r n ) = {$ E I dist(i?, ©o) ^ r n } in some metric with r n > and aim at testing 
Hq : 'd E @o against the local nonparametric alternatives H\(r n ) : •& E ©i(r n ). The aim is to 
provide a test of uniform level a E (0, 1), at least asymptotically, which has asymptotically 
power 1 uniformly over the alternative H\{r n ) for a separation rate r n — > as fast as possible. 
Since testing and estimation are strongly related, the best separation rate is often (mainly 
for testing functionals) equal to the best estimation rate or can be translated to it. We shall 
focus here mainly on these rates which quantify the complexity of the statistical problem and 
only touch upon asymptotically exact critical values. Deriving the latter becomes much more 
technical, e.g. requiring uniform central limit theorems under Hq, or degenerate since often 
an imposed bias condition on the nuisance part dominates the test statistics asymptotically. 
Adaptive nonparametric testing is of high importance in practice, but beyond the scope of 
the current work. 

To the best of our knowledge a general approach for testing the quantities in the character- 
istic triplet of a Levy process has not yet been pursued. On the other hand, very profound and 
interesting estimation, testing and confidence statements have been derived in the literature 
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for a variety of situations. We shall discuss the corresponding literature and their relationships 
in detail when presenting our results. These discussions also include open problems that seem 
to be of fundamental importance and show the frontiers of our knowledge on inference for 
jump processes, while encouraging hopefully further research in this area. 

Before treating specific testing problems let us give a very rough guide to what kind of 
inference is possible via the characteristic function approach. We use standard notation for 
asymptotic quantities like a n = 0(b n ) or a n < b n if a n ^ Cb n for some constant C > 
independent of n or X n = Op(b n ) if (X n /b n ) n ^i is stochastically bounded or tight. Only for 
heuristics we use a n ^> b n meaning a n is much larger than b n , mostly a n /b n — > oo. 

Inference on the characteristic triplet T = (cr 2 ,b, v) will be based on the empirical char- 
acteristic exponent ip n {u) : = A" 1 log((£> ra (u)), where the distinguished logarithm is taken, i.e. 
that branch of the complex logarithm which renders ip n continuous with V'n(O) =0. A lineari- 
sation yields ip n (u) — VK n ) ~ a (u) • ^ ne stochastic error or noise level in tp n {u) is thus 
of the order 

Var C (^M) _ V\<Pn(u)\- 2 - 1 

We can reliably distinguish between two characteristics (bj,aj,Uj), j = 0,1, at frequency 
u if the 'signal' (ipi — ipo){u) is significantly larger than the noise level in the observations 
generated by the characteristics with j = 0, say, i.e. 



i(ft-*)wi > >*',%;' - 1 . (1.D 



■"n 



We have for any characteristics |(/? n (u)| > e~ AnC u for some c > 0. Because of \ipQ^ n (u)\~ 2 = 
exp(2A n |Re(V'o(w))|) ~ l + 2A n |Re(V>o(«))| for \Re(i/jo(u))\A n — > the requirement (jl.ip thus 
imposes the restriction 

-== « \u\ < {^/2c n )- l J&n l \ogn. (1.2) 

VnA n v 

Let us suppose c n = c to be constant and then consider the two main frequency asymptotics: 

(a) (HF-estimation) A n = 1/n: the requirement is 1 <C \u\ < (v / 2c)~ 1 v / nTogn. 

(b) (LF-estimation) A n = 1: the requirement is n^ 1 ' 2 <C \u\ < (v / 2c) _1 y / Iogn. 

These heuristics fit well with the theory for these cases. In the high-frequency case we 
cannot identify, even for continuous-time observations, the jump measure away from zero 
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(there are only finitely many large jumps occurring), but the volatility and the index of 
jump activity around zero are identifiable and they determine the behaviour of <p n (u) for 
| u | —> oo. The upper bound for \u\ shows that the stochastic error prevents us from using 
too large frequencies and indicates roughly a y/n-iate that is at best attainable. In contrast, 
the low- frequency regime allows asymptotically to identify the entire triplet, but we might 
face logarithmic rates in n. If no Brownian motion is present, i.e. o = 0, the characteristic 
function decays less rapidly and faster convergence rates are possible. 

2 Tests on volatility 

2.1 General idea 

A first important problem is to test the hypothesis Hq : o 1 = <Tq for some fixed volatility 
<7o ^ 0, while the drift b and the jump measure v remain largely unspecified. When passing 
to equivalent measures on path space, the volatility remains invariant and a hypothesis on 
the volatility in finance is often obtained from data in a risk-neutral setting like option price 
data. 



The important observation now is that — ^-it 2 is the dominant term in the characteristic 
exponent ip(u) for large frequencies |u|. If we apply the general signal-to-noise ratio idea (|1.1|) 
to distinguish with some alternative volatility ci, the frequency U n > we choose should 
satisfy 

, 2 2nr2 U n e^lUl/2 

k? - °IPI » - T = V ■ 

If there were no nuisance induced by the jump measure, we would just optimise over U n and 



take U n = \/2An Gq , which yields the parametric n~ ^-separation rate for |<r 2 — Cq|. 

Genuinely, however, we have to take into account the impact of drift and jumps. Since 
the volatility only appears in He(ip(u)), the drift does not interfere when the test is based on 
the real part Re( , n (n)) = (ifj n (u) + ip n (—u))/2, but the jump measure does, of course. In the 
spirit of Jacod and Reifi [16] we classify the jump activity of v by a jump index (3 £ [0,2] of 
Blumenthal-Getoor type of intensity R > and consider 

i/ ,i/i £BG(f3,R) = iv l\2 1 -^\xf\Jx 2 )v{dx) ^r\. 

2 

In terms of the characteristic exponent this assumption implies Re(ipj(u) + ^u 2 ) G [— R\u\P , 0], 
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0, 1, because 



'5 - 1 -: 



Re ( / (e tux - 1 - im) Vj{dx)\ = I (cos(ux) - 1) i/j(rfx) 

2 2 /• 

(^- A 2) Vj(dx) > -2 / |«a;/2|^i/j-(ds). 



For the separation of the volatilities this additional bias induced by the jump activity thus 
requires 

\al-al\U 2 n ^RU^. 

For this part, the larger U n the better and considering bias and stochastic part together, we 
should optimize U n such that 

max ( U^e M ^(ffJ2 + RuS)) ,. 3 x m . n . 



o o I 1/2/ 9 Pi 

Asymptotically for large U n , oq > and nA n — > 00, this gives C/ n = Oq A n y log(nA„ ) 
and the separation condition 

Mog(nA„, M ) / 

For nA n < 1 the bias bound is asymptotically negligible and we choose U n = J 2 An Og" as 
in the parametric case. The critical line nA n =1 reduces in the high-frequency framework 
A n = - to the case /3 = 1, where it separates the standard parametric- type bounds for /3 ^ 1 
with the nonparametric bounds for (3 > 1 [16 . For the low- frequency setting or more generally 
A n > n -1 ' 2 the parametric type never appears. 

In the case gq = 0, the characteristic exponent under Hq changes and always yields 
the choice U n = (RA n )~ l 'P \og(y/n) which implies the faster separation rate o\ 3> 
R 2/3 A n (logn)P~ 2 . For any A n — > with polynomial rate in n there is a (3 > such 

that the separation rate is of super-efficient order o(n -1 ' 2 ), which seems counterintuitive. 
Yet, note that in the parametric case the testing problem faces the degenerate law N(0, 0) 
under Hq and thus the rate is dictated by the bias part. 

2.2 Test construction and properties 

Let us specify the first nonparametric testing problem more rigorously as 

H : T e 9 versus H\{r n ) ■ T G 61 (r n ) 
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with 

9 = {{alb, v) | 6 € R, v € BG(P, R)}, 
©i(rn) = {(<x 2 ,M I k 2 - o-ol > r„(7g,6 € R,i/ € BG{P,R)}, 
where <7q > and r ra J. is a separation rate that tends as fast as possible to zero while the 
two hypotheses can still be distinguished asymptotically. 

The above discussion was based on the characteristic exponent. For a concrete construc- 
tion of the test and limit theorems the characteristic function itself is more convenient. The 
variance result for (p n {u) implies by the inverse triangle inequality 

E[(|0 n («)| - Wn{u)\f] ^ E[|0„(«) " Mu)\ 2 ] < -• 

n 
For the characteristic functions cpo n in our hypothesis Hq we have 



This implies by Chebyshev inequality for any k > 1 the conservative, but uniform convergence 
bound 



limsup sup P T (\<p n (U n )\ (£ An >K ) ^ K 2 with A n<K 

n->oo Te8o 



-A„(^l£+flttf) _ JL e ~^Aul + K 



n \/n 

<5 



According to the preceding discussion, fix some small 5 > and put e = if A n ^ n and 
£ = 35 otherwise and then choose the frequency 



un=\ r- —sr n '— , M 



11 - £ )l og(nA^ /3 )V2 

CTo A n 

where the additional factor 1 — e will ensure uniformity over the bias part. We arrive at the 
acceptance interval 

A n , K = k+ a " - " A„ + -£=1 with A„ = K^)-* 1 -)/ 2 A e-\ a n = 2Ra^Ut 2 - 
L 01 0iJ 

(2.3) 

2.1 Proposition. For the testing problem specified in (|2.ip wi/j o~o,/3,R > and separation 
rate 

^ = ^ ( °8*-, . ) ,2 " J,/2 + * 

y/n \{l-e)log(nA 2 n ~P)\/2J s/n 

the test that accepts if \(p n (U n )\ £ A u ,k with frequency U n from (|2.2ft and acceptance inter- 
val A n ^ K from (|2.3|) has asymptotic level at most k~ 2 uniformly over the hypothesis Hq and 
asymptotic power 1 over the alternative Hi(r n ). 



r n = 2a n -\ — = ARa ( ^Zfl ) -\ y= for some p n ->■ oo 
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Proof. It remains to consider the asymptotics under the alternative. In the case e > we have 
A n ^ n - ' 1-5 " 2 , in the case A n ^ n we have A n ^ n -v 1 -\' 2 -p) °)/ 2 ^ whence from a n — > we 
infer y/rL4 n +an — > oo and the left endpoint of A UjK tends to infinity. Consider now a triplet 
from the alternative with a 2 ^ Og(l + r n ) such that its characteristic function satisfies 

lnn(^)Ke- A »^ = i^. 

In the case A n > n we have a n > A„ > n and thus ^fnA\^ rT ' a r n > n e ' 2_ ' ^ — >• oo since 
e/2 — 5 > 0. In the case A n < n~ s (e = 0) we find 

A„Vn(r„ - On) logpl" 1 ) 

> (nA 2 ^ V ^^^((nA-^- 2 )) 1 / 2 ^^ 2 ^) V 1)(^ 2 )/ 2 + p n )(log(nA 2 ^) V 1) 
~ (log(nA^))^ + (nA^)-^p n -> oo. 

Furthermore, note ^4^™ — > 1 for logr n < — logn (satisfied in case e = 0). Putting these 
asymptotics together, the distance to the acceptance interval, weighted by s/n, tends to 
infinity using x p — 1 Js plog(x) for x,p > 0: 

^{A\ +a - - Al +r -) > V^Al +r "(r n - a^logiA- 1 ) -► oo. 

A completely symmetric analysis for a triplet from the alternative with a\ ^ cr 2 (l — r„) yields 
the same order for the distance to the acceptance interval. Consequently, we have 

lim inf P T (\ip n (U n )\ £ A n , K ) = 1 

n^ooTG0i(r„) 

and the power of the test on the alternative Hi uniformly tends to one. □ 

For o"o = we change the hypotheses by setting 

G = {(0, b, v) | b € M, v G BG(J3, R)}, @i(r n ) = {(a 2 , b, v) \ a 2 ^ r n , b G K, v G BG{fi, R)} 

(2.4) 
and we consider 

U n = (,RA„)- 1 //3(log(^/(2 K ))) 1 ^, An, K = K 1/2 , oo). (2.5) 

2.2 Proposition. For the testing problem specified in (|2.4p with /3 > and any separation 
rate r n — > satisfying 

oo 



vloen/ 
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the test that accepts if \<p n {Un)\ £ •A nK with U n and Ann from (|2.5[) has asymptotic level k~ 2 
uniformly over the hypothesis Hq and asymptotic power 1 over the alternative H\(r n ). 

Proof. Under the hypothesis Hq we just use the bias bound 

and the same deviation bound (by Chebyshev's inequality) as before to attain the uniform 
level. Under the alternative we have 

and we conclude again by applying the uniform deviation bound. □ 

2.3 Discussion and extensions 

Due to the semi-parametric feature of the testing problem the asymptotic level of the test 
is given by the limit of the worst case error probability of the first kind over Hq (in this 
order!). If for each n only a single hypothesis is tested against a single alternative, then a 
parametric n -1 / 2 -separation rate can be achieved. This is discussed in detail by Ait-Sahalia 
and Jacod p] where even the exact LAN-property is derived and instead of Brownian motion 
also any stable process can be taken that dominates the remaining jump part. For their rather 
involved estimation procedure the authors then also prove a nonparametric n~ 1 ' 2 A A„ - 

rate, which is up to the loss of the logarithmic factor exactly our separation rate r n . As 
discussed in Jacod and Reifi [16J for the high-frequency case, this logarithmic gain is possible 
also for estimation, the estimator simply being 

a n := — 2~Re(log(<p n (U n ))) with our choice of U n . 

The rate-optimality of this estimator has been established and the lower bound proof directly 
delivers the lower bound for our separation rate in Proposition 12.11 Testing the hypothesis 
Hq : a 2 = has been studied by Ait-Sahalia and Jacod [2] in a much wider semi-martingale 
setting, but reduced to the Levy case their results are less precise than ours. It would be 
interesting to study the super-efficiency of rates o(n~ 1 ' 2 ) for less restrictive models than pure 
Levy processes, e.g. allowing for time-varying, but deterministic characteristics. 

Another interesting observation is that the Blumenthal-Getoor-type index /3 is only used 
to guarantee the order \U n \" for the bias term in the characteristic exponent. In the case 
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of finite intensity and a jump density (with respect to Lebesgue measure) u, the Fourier 
transform satisfies \J-v(JJ n )\ = 0(U~ S ) whenever v £ C S (R) with integrable derivatives of 
all orders. In that case the test statistics should be based upon f Q Re(ip n (uU n ))w(u)du with 
a weight function w satisfying f Q u 2 w(u)du = 2 (to recover a 2 ), L w(u)du = (to filter 
out A = z^(M)) and sup ne r 0j i] u~ s \w(u)\ < oo (to ensure J\uU n \~ s w(u)du = 0(U~ S )). This 
approach ensures that the bias is of order 0(U~ S ) while the stochastic error is not increased 
since only frequencies up to U n enter. Testing and estimation of a (in the case a > 0) 
is therefore possible with rate (t — i \ )( 2 + s )/ 2 v —t=. This is particularly interesting in the 
low-frequency regime A n = 1 with rate (logra)~' s+2 " 2 . In the related setting of estimation 
from financial option data the weight function approach has been successfully analysed and 
applied by Belomestny and Reifi [I]. Sohl [23] addresses testing and confidence regions for 
this situation in detail and Sohl and Trabs [22] apply this to financial data. Generalisations 
to other processes defined by their Fourier transform are possible, e.g. see Belomestny [6] for 
estimation in affine processes. 

An accurate choice of the critical value n to attain a certain asymptotic level should 
be based on a uniform central limit theorem. For empirical characteristic functions <fi n and 
corresponding characteristic functions (p n a central limit theorem for triangular schemes yields 

V^^n 1/2 (M^n(Un) ~ <p n (U n )),Im(0 n (U n ) - <f n (U n ))) -> iV R2 (0, J) 

with E n el 2x2 equal to 

1 / l + Re(^„(2^„))-2Re(^(C/ n )) 2 lm(<p n (2U n )) - 2Re(<p n (U n )) Im(<p n (U n ))\ 

2 \lm( Vn (2U n )) - 2Re((fi n (U n ))Jm((p n (U n )) 1 - Re(<p n (2U n )) - 2Im( Vn (U n )) 2 ) ' 

The form of S n can be easily checked using trigonometric identities, e.g. E[cos(uXfc) 2 ] = ^ (1 + 
E[cos(2-uA^.)]) for the variance of the real part. In the case nA n — > oo both ip n (U n ),ip n (2U n ) 
tend to zero whereas nA n — > implies ip n (U n ) — > e~ 1 , ip n (2U n ) — > e -4 (we omit the 
intermediate case). We apply the A-method for the mapping (x, y) \— > \Jx 2 + y 2 and conclude 
for our setting of testing under Hq : a 2 = <7q > 



Vn(\0 n (U n )\ - \<p n (Un)\) -> N R (0,v 2 ) with v 



1/2, if nAl'* 3 -^ oo, 

(1 - e~ 2 ) 2 /2, if nAl'P -»■ 0. 



To attain an asymptotic level a £ (0, 1) we should therefore choose k = vqi_ a / 2 with the 
(1 — a/2)-quantile of iV(0, 1). Remark that based upon \(p n (U n )\ the test is asymptotically 
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distribution free for the composite hypothesis Hq. Let us recall that uniformity in the central 
limit theorem over the null hypothesis is not established here. 

For general semi-martingale models our simple characteristic function approach does not 
allow inference on the integrated volatility L cr 2 dt, assuming here A = — for simplicity. 
The empirical characteristic function of the increments, however, still estimates the realized 
Laplace transform L e~ u Ut > 2 dt for stochastic volatility a 2 . Tauchen and Todorov [21] pro- 
vide a uniform (in u) central limit theorem for this case under jumps of bounded variation 
and discuss the potential of this Laplace transform approach for applications. For integrated 
volatility inference a direct extension of the present method is possible if the local charac- 
teristics of the semi-martingale vary smoothly and we localize the test statistics on small 
blocks. 

In the area of high-frequency financial statistics power variation methods are traditionally 
used, comparing discrete p-variations for continuous martingales and jump processes. It is 
quite striking that a slightly smaller choice of the frequency U n transforms our test statistics 
to a truncated quadratic variation statistics. If S n := U 2 A n — > 0, we have for any L 2 -Ito- 
semimartingale X with bounded characteristics 



$n{U n ) - 1 = ~ V\ 

n *-^ 

k=\ 

1 n i 

-(V(^nAj?X - f (ApC) 2 )l ^ v _ x ) + O0£/ 2 ) + -#{fc : |AjpT| > S^U- 1 } 



■ iU n A"X ( , ,il > 

n ^ e ^UA^X^S^U- 1 } ^ i {|Ajf.X>£ /4 l7n 1 }'' 

fe=l 



n\^' - • t {|Ajjx|<*i'*tf» 1 }v n 

fc=i 



iU - U 2 n 

H Yl A k X1 { \ A n X ^ S ^ U -l } - gT S( A * X ) 2l {[A]tX|<*y%- 1 } + °P( 5 n 2 + A nS- 1/2 U 2 ) 



fe=i fe=i 

'TT n TT^ n 

= ^ E A ^ X1 {\^X\^/ A ^} ~ 2^ E( A ^) 2l { |A^K« 5 yV- 1 } + °P( 6 n 2 )- 
fe=l fe=l 

Because of J2k=i( A k X ) 2 = Op(nA n ) this implies 

TI 2 n U 2 / n \2 

la.dy,. _ i _ ^ E(^) 2 i { ,A E x,«y v,r>, + #( E ^^Ww^,) + <*<«■ 

fc=i fc=i 

The second sum, which is not familiar for standard high-frequency variation estimation, is 
just an (asymptotically negligible) bias correction. Note that for inference purposes the size of 

1/2 

the approximation error is essential and the order Op(5 n ) = op(l) will often be sub-optimal 
and a more subtle argument than a second order Taylor expansion should be used. Still, the 
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derivation shows conceptually that we recover the truncated realized variance estimator when 

— 1/2 

we choose frequencies U n = o(A n ), which correspond exactly to standard truncation levels 
of larger order than n -1 ' 2 in the high-frequency setting, see e.g. Ait-Sahalia and Jacod pQ. 
Let us close this section by formulating two open problems that show our lack of fundamental 
understanding when passing from the pure Levy to the general semi-martingale setting. 

Problem 1. Prove or disprove that in the case of semi-martingales with A n = - and (3 = 1 
(bounded variation of the jump part) the integrated volatility L a^dt can be tested or estimated 
at the parametric rate n -1 ' 2 , as is the case for Levy processes or for semimartingales with 
P < 1. 

Problem 2. Assume that the observations are Y\~ = XkA„ + £k with noise variables ej~ ~ 
N(0, 1) i.i.d. independent from X. Find the optimal separation rates for testing based on these 
noisy observations. 

The first problem seems to be completely open and quite fundamental. The second problem 
is of high interest for financial statistics under microstructure noise assumptions and even for 
b = 0, v = the optimal rate is n -1 ' 4 in the case A n = - (Gloter and Jacod [13J). The 
combination of microstructure noise and nuisance by jumps for inference on the integrated 
volatility is a very active area of current research and definite answers even for the simple 
Levy case still lack. 



3 Testing indices of jump activity 

Another stream of recent research has concentrated on the estimation of the Blumenthal- 
Getoor index or other jump activity indices for Levy processes or more general semi- 
martingales. The rationale for infinite activity measures like the Blumenthal-Getoor index 
is again that these indices remain invariant when changing to equivalent measures and they 
can be recovered already in the asymptotics of a high-frequency setting on a bounded ob- 
servation interval. The basic idea is that these indices dictate the behaviour of ip n (U n ) for 
large frequencies U n . In contrast to inference on the volatility, however, two sources of nui- 
sance enter: the terms due to volatility and due to the general form of the jump measure. We 
concentrate on two prototypical testing problems in this context. 
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3.1 Testing the presence of jumps 

The easiest inference question is to test Hq : v = against the presence of a jump measure. 
We quantify the contribution of v in the alternative by H\ : J x^v^dx) G [r n ,oo] with <r 8 + 
f x s v(dx) ^ R 8 for some R > and otherwise arbitrary a 2 ^ and b £ R. In order to be able 
to separate the alternative even for finite jump activity z/(IR) < oo, we suppose nA n — > oo. 
At first sight this problem requires the characteristic function approach, but surprisingly it 
is solved very easily by a direct moment method. The formulation of the alternative is tailor- 
made for our purposes. For general characteristics the following curtosis equation follows 
directly from (log</2 n )' 4 )(0) = A n f x 4 u(dx) if that expression is finite: 

E[(A£X-E[A£X]) 4 ]-3E[(A£X-E[A£X]) 2 ] 2 = A n I x 4 u{dx) ^ 0. (3.1) 

Consequently, the corresponding empirical moments have a stochastic error of or- 
der Op(n _1 / 2 (A n f x 8 v(dx) + A 4 (<7 2 + J x 2 v(dx)) 2 )). The latter is always of order 
Op(R 4 n~ 1 ' 2 A n ), but note that the error level under the hypothesis for A n < 1 is much 
smaller than under the alternative. The test accepts Hq if the empirical moment in (|3.ip 
is within [— Ki2 4 A 2 , /tit! 4 A 2 ] for some fixed critical value k > 0. The separation rate is 
A~ 1 n _1 ' 2 A n in the sense that for r n (nA n ) 1 ' 2 — > oo it has asymptotic power 1 on the 
alternative. 

To see that the separation rate (nA n ) -1 ' 2 is minimax optimal, just consider the test 
between the two single hypotheses Hq :cj = 0,6 = 0, v = (process is constant zero) and 
H\ : a = 0, b = 0, v = (nA n )~ 1 6( n/ ±\i/a ((nAn) 1 ' 8 times a Poisson process of intensity 
(nA n ) _1 ). Then with probability e~ 1 under the alternative the process remains equal to 
zero on [0, nAJ and thus no test can distinguish between Hq and H\. The alternative jump 
measure obviously satisfies J x^v{dx) = (nA n ) -1 ' 2 as well as f x 8 v{dx) = 1. Any test of level 
less than e _1 therefore must have power less than 1 (both uniformly in n). 

The lower bound argument with point masses that shift to infinity also shows that the 
specification of the alternative has a great impact on the optimal solution. If the alternatives 
separate like u{M) G [r n ,oo] or J x 2 v{dx) G [r n , oo], they do not separate at all from Hq 
since infinitely divisible distributions whose Levy measures satisfy x 2 v n {dx) — > 5Q(dx) weakly 
converge to a normal distribution and thus Brownian motion cannot be separated from pure 
jump processes with f(x 2 A l)v(dx) = 1. Let us refer to Neumann and Reifl [TS] for a similar 
argument in total variation distance and the implication that (honest) testing between general 
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classes of pure jump processes and diffusion processes is not possible, even in a high-frequency 
setting. 

3.2 Testing an index of Blumenthal-Getoor type 

Informally we want to test the hypothesis Hq that the Blumenthal-Getoor index /3 of the 
jump part equals (3q G [0,2) against the alternative Hi that |/3 — (3q\ ^ r n for unspecified 
drift b and volatility a 2 ^ 0. For stable and tempered stable processes it is known that the 
Blumenthal-Getoor index exactly describes the growth behaviour \U n \^ for the jump part 
of the characteristic exponent as the frequency |£7 n | tends to infinity. Pragmatically, we just 
generalise further to all Levy processes such that the characteristic exponent for frequencies 
U n — > oo satisfies under Hq 



2 

Re(iP(U n )) = ~ U n ~ R\Unf° - r(U n ) with |r(«)| < R'(l + \u 



where we suppose that the remainder t(u) is for large frequencies of lower order, that is 
j3' < /3, cf. Belomestny [5J for a broader discussion of this assumption. The index /3q appears 
in the second-largest term and we have to filter out the quadratic term in the case a 2 > 0. 
As mentioned earlier, filtering can be achieved by considering a weighted mean over ip. To 
this end, we consider general finite weighting measures w on the Borel sets of [0, 1] with 
L u 2 w(du) = and W((3a) := L u^°w(du) > 0. The weighted characteristic exponent over 
[0, U n ] then satisfies 

f Re(ij(U n u))w(du) = -RW((3 )U^° - [ T(U n u)w(du). 
Jo Jo 

We have \f Q T(U n u)w(du)\ ^ R'(l + Un )\\w\\tv an d thus the term of interest dominates. 
Our test should be agnostic with respect to the additional parameter R. Belomestny [5] 
uses the natural trick to consider the logarithm in which case log(J* ~R.e(ip(U n u))w(du)) = 
/3o log(U n ) + 0(1) holds. For his low-frequency situation the only logarithmically smaller bias 
term does not harm, but for our general sampling scheme this is highly suboptimal. A better 
idea is to use two weighting measures wi,W2 with the properties of w above and with different 
values Wi(Po) / W 2 (Po) such that 

J 1 Re(^(^ n ^)) Wl (^) __ W^^ + OjR'R- 1 ^'-^) 
tiRe(iP(U n u))w 2 (du) ~ W 2 (f3 Q ) + 0(R'R- l U n H '- (3 °y 
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In terms of Re(V>(ti)) = A -1 log(|<p n (u)|) our test statistics is therefore 

J Re(i> n (U n u)) w 2 (du) 
with the quotient Q(f3) := Wi(P)/W 2 (P). For all indices (3 G [0,2) \ {/3 } we should have 
\Q(/3) — Q(Po)\ ^ c\/3 — /3q\ for some c > such that under the alternative the test separates 
well. A possible choice, inspired by Belomestny [5], is to take point measures 

with different 771, 772 6 (0, 1). Then Q(/3) = (??f — T] 2 )/(r)2 ~ ^l) satisfies the separation require- 
ment (its derivative is continuous and does not change sign). 

Consider the test problem Hq : T G 0o versus H\ :T G 0i, n for 

e = {reP(s) |Re(V'r('")) + 4 n2 + ^l M l /3o K^ jR ( 1 + l n l /3o ~ p )^>o}, (3.2) 

©i,n = {r G P(ff) I |Re(^r(u)) + ^-u 2 + R\uf\ ^ KR{1 + M*-'), \QiP) " Q(Po)\ >r n ,R> o} 

(3.3) 
within triplets satisfying a uniform maximal decay condition 



*2 



V{&) = {T|VuGlR: Re(^r(«)) G [-^-(1 +u 2 ),0]} 

and with some /3o G (0,2), a separation rate r n \, and ET, <r 2 > 0, p G (0, /3o]. The decay 
bound involving a 2 is used to bound the stochastic error uniformly. In practice and also in 
theory, it may be replaced by a minor overestimate of the true volatility. The values K and 
p are needed to bound uniformly the bias due to the second largest jump index. Regarding 
K note that a very small level R for the leading index /3 might be confounded by a high level 
R' of the second index (3 — p. We get the following first result where the bias bound for T n 
dominates the stochastic part. 

3.1 Proposition. Suppose nA n p ~ — > 00. Consider the test on (|3.2p and (|3.3p that rejects 



if 

\Tn\ > S^(< 7 n /3 ° + U-P{\\ Wl \\ TV + WMTvQiPo))) 

with U 2 = A" 1 ^ -2 log(raA„, p ~" }. Then the test has asymptotic levelO and for the separation 
rate r n — > such that 



A£ /2 (logn)-p/ 2 
the test attains on 0i n the asymptotic power 1. 



00 
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Proof. For 7~ £ 0o U 0i, n with general jump index f3 we obtain from the above arguments 
the uniform bias bound 



J Re(ip(U n u)) w\(du) 



J Re(^(?7ntt))«;2(rfw) 



Q(/5) 



^ 



A' 



W 2 (/3)+0(C/- p ) 



(t^ + U-»{\\ Wi \\tv + lk 2 ||TvQ(/3))). 



The stochastic error on the good event 

G n : = {Vu G [0, 1] : |(^ n (C/ n u)| ^ \\<p n (U n u)\} 
can be bounded via ip n — ip = log(l + {(p n — ip n )/(p n ) and Taylor expansion by 



(3.4) 



(3.5) 



i>n(U n u) ~ ^(U n u) 



<Pn(U n u) - <p n (U n u) 



A n (f n (U n u) 



^2 



A n \if n (U n u)\ 2 



This implies 



E 



o 



Re(t/j n (U n u) - ip(U n u))wj(du) 



uj„ 



^n 1 A n 2 ||^||^ y max \(p n (U n u)\ 2 (l + o(l)). 
«e[o,i] 



For our choice of U„ 



oo 



max \ Vn (U n u)\~ 2 ^ exp(A n a 2 (l + U 2 n )) < nA 2 ^"* 
«e[o,i] 



holds. We obtain the uniform bound under Hq 



J Re(i> n (U n u)) wi(du) J Q Re(ip(U n u))wi(du) 



J Re(ifj n (U n u)) w 2 (du) J Re(ip n (U n u)) wz(du) 



<u: 



-Po 



max 
3=1,2 



Re(4> n (U n u) - ip(U n u)) Wj(du) 



= P (U-^A-^ 2 A^-M/ 2 ) = o P (U-^ 

where the small op is due to A" 1 = o(t/ 2 ) for nA n p — > oo. Thus, the bias bound from fj3.4f) 
dominates, which defines up to an inflation by e the acceptance interval, and the level of the 
test tends to zero, provided P(Q n ) tends to one under Hq. Because of m.m u£ [ ^\ip n (U n u)\ ^ 
n~ 1 ' 2+£ ' for large enough n, which is of larger order than y/(\ogn)/n, we infer P(G n ) — > 1 
under Hq from the uniform deviation bound for the empirical characteristic process, proved 
in Theorem 4.1 of [19j using bracketing entropy. 

The power result is directly obtained from the corresponding bias bound under the sepa- 
ration rate since the same stochastic error bound shows that the bias part is dominating. □ 
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In the low-frequency setting A n = 1 the rate r n is exactly that of Theorem 6.7 in 
Belomestny [5], which is proved there to be minimax optimal. In general, the condition 
nA„ p ~' — > oo seems artificial, however. In the high-frequency regime A n = - it requires 
A) — P > 1] which in particular excludes jump indices of order less than one. On the other 
hand, we know from Ait-Sahalia and Jacod [3J that also (3q ^ 1 can be estimated and thus 
tested. The fundamental reason for our suboptimal result in this situation is that we have 
not used the special properties of the empirical characteristic function under a dominating 
Brownian motion part. 

In fact, instead of allowing for positive volatility exactly the same approach could also allow 
for a dominating a-stable process with a > (3q, when the weights satisfy the filtering property 
L u a w(du) = 0. Since the characteristic function then decays more slowly, we choose U n of 
order (A" 1 log(nA n p j) 1 '", assuming nA n p — > oo, and obtain the separation 

rate (A n /logn) p ' a . For A n = - we thus impose /?o > (1 + p)&/2, which requires /?o > a/2. 
The latter is now a very natural condition because it is well known (e.g., Sato [21]) that for 
/?o < ct/2 the laws of a (tempered) a-stable and of the sum of a (tempered) a-stable and a 
(tempered) /3o-stable process on the path space D([0, 1]) are equivalent, which statistically 
means that these two process models cannot be completely separated by a test even based on 
continuous-time observations on [0,1]. 

The distinction between a Brownian motion and an a-stable process with a close to 2 
seems statistically not significant after all we have seen so far, but the opposite is the case. 
The difference does not concern the 'signal' in the characteristic function, but the factorisation 
of the covariance function of the empirical characteristic function. We have 

Covc(Mu)Mv)) = 1 / (A n (a'uv+R(\uf+\vf-\ U -vf)+T(u)+r(v)-r(u-v)))-l) . 

(Pn{u)ipn{-V) n\ \ J J 

— 1/2 

Expanding the exponential this evaluates for \u\, \v\ — > oo with \u\ + \v\ = o(A n ) to 

1 2m- 1 / . 2 \k 

\ (0(A n (\uf + \vf))+Y: ( ^ + 0(A^M 2m )) . (3.6) 

fc=l 

— 1/2 

Let us profit from the product structure in u and v of the summands. For U n = o(A n ) and 
finite weight measures Wj, j = 1, 2, on the Borel sets of [—1, 1] satisfying 



Wj(-B) = Wj(B), / u 2p Wj{du) = for p 
Jo 



l,...,m-l (3.7) 



18 



Markus ReiB 



we obtain on the good event Q n from (|3.5p the estimate 



E 



Re(ift n (U n u) — ip(U n u))w(u)du 



o 
< E 



l Gn 



1 ^-i r <Pn(UnU) - (p n (U n u) Q / / ip n (U n u) - fn{UnU) \* , _ ^. 



ip n (U n u) ' " VV ip n (U n u) 

A^O^AnUi + n- l (A n U 2 n f m + n- 2 A n [/ 2 ). 



For the last bound we have inserted the sum from (|3.6[) whose integral vanishes by the proper- 
ties ([32D of to, the order U n = o(A; 1/2 ) and the bound E[|/(- • • ) 2 dw\ 2 ] = 0(max M E[(- • • ) 4 ]) 
by Jensen's inequality. As in the preceding proof, we have P{Q n ) —> 1 and thus uniformly 
under Hq 

T n = 0(U-P) + P ({nA n )- l l 2 U-^{Ut /2 + A^ l / 2 U 2 n m + n" 1 ^)) 
= 0(U- p ) + Op{{nA n )- x ' 2 {U-M 2 + A™- 1/2 U 2 ( m -P°^)). 

Let us suppose p < /3q/2 such that U n ° = o(Un P ) and by balancing the first and last rate 
we choose the maximal frequency 



JJ = n l/(4m+2p~2/3o) A -(m-l)/(2 m +p-/3 ) > 



(3.8) 



Exactly as before we thus derive the following result for the same test, but at a different 
frequency and with moment conditions on the weights. 

3.2 Proposition. Suppose p < /3q/2 and that w\, w<i satisfy (|3.7|) for some m ^ 2. Consider 
the test on (|3.2p and (j3.3|) that rejects if 

K + e 



-^ 77 ^ 



-{U-P + l7-"(||«;i||rv + I|w2||tvQ(A))))- 



VWq) 

with U n from (j3.8|) and any e G (0,1). Then the test has asymptotic level and for the 
separation rate r n — > such that 



Un 



p/(4m+2p-2 / 3 )A" (m ~ 1)P//(2m+P " /3o) 



X 



n 



the test attains on 0i ;n the asymptotic power 1. 

For the high-frequency case A n = — the separation rate is n -p( m - 1 / 2 )/( 2m +P~l 3 o) ^ which 
for m = 2 (wj only filter out u 2 , which we need anyway) yields n -3p '' 8+2p-2 "°) while for 
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m — > oo the separation rate approaches n~ p > 2 . Note that by the Weierstrafi approximation 
theorem no weight w can satisfy (|3.T|) for all m, but for all finite values of m such weights 
exist. Given the bound p < fio/2, we can thus almost attain the rate n - ^ ' 4 provided the bias 
due to the nuisance part only grows with exponent p = p\/2 in Re(?/>) and the weights have 
a high number m of vanishing moments. 

This rate consideration is extremely interesting because even in the simple parametric 
model where the sum of a Brownian motion and a /3o-stable process is observed, the optimal 
rate is n~^°' 4 (log n)^ ' 4-1 (Ait-Sahalia and Jacod [3]). The authors provide a procedure which 
for very general semi-martingales estimates a generalized jump index f3o with a rate that is 
significantly slower unless /3q is close to zero, compare the discussion in their Section 6.2. 
The procedure is based on counting increments that are larger than for the pure Brownian 
case, their truncation value u n plays a role as the inverse frequency U~ l in our approach. In 
a second step the authors consider also the estimation of secondary and further successive 
Blumenthal-Getoor-type indices which suffers, however, from even slower rates of convergence. 

Another kind of jump intensity measure has been considered by Trabs [25] for pure jump 
self-decomposable processes where the density of the Levy measure behaves like v(x) ~ a|a;| _1 
for x — >• for some a ^ 0, possibly different for x 1 and x \ 0. A well known example is 
furnished by Gamma processes and the value a determines many probabilistic properties 
of the self-decomposable processes. Estimating a is possible with polynomial rates, even for 
option price data, and the rate itself depends on a. 

In view of the preceding results and discussion let us conclude by asking the imminent 
question, for which there is no satisfactory understanding yet, leaving aside the even more 
challenging questions for general semi-martingales. 

Problem 3. Construct an estimator or a test for fto which achieves exactly the parametric 
rate for Po-stable processes. If this is not possible, provide a lower bound proof for that. 

4 Nonparametric test on the jump measure 

4.1 General approach 

Inference on the jump measure as a natural nonparametric object is another key topic of 
current research. If we are not interested in the realized jumps, but in their intensity, we have 
to assume an asymptotically infinite time horizon nA n — ► oo. Concerning testing problems 
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there are again a variety of interesting hypothesis formulations. We restrict ourselves to a 
natural form, based on the observation that small jumps and volatility interfere and from a 
probabilistic side the finite measure 

u a (dx) = x v(dx) + a 5o(dx), 

which combines the downweighted jump measure at zero with a point measure of size a 2 
at zero, determines the topology in the law of infinitely divisible distributions: if a sequence 
(va )m converges weakly to v a and the drifts satisfy b m — > b, then also the finite-dimensional 
distributions of the corresponding Levy processes converge, see e.g. Sato [21J or Neumann 
and ReiB [19J. The measure v a is also quite tractable as 

tf'(u) = -a 2 + f(-x 2 )e iux u(dx) = -Tv a {u) 

holds. We thus aim at testing the null hypothesis H q : v a = v a $ = x 2 vo(dx) + a^d^dx) for 
some given jump measure uq and volatility Oq. In classical nonparametrics, e.g. in regression, 
the local alternative would be formulated by postulating an L 2 -distance of rate r n and a given 
smoothness class, cf. Ingster and Suslina [15]. Here in the case of finite measures, we shall 
also consider the case of point measure to encompass the volatility. 
Our test is based on the smoothed L 2 -statistics 

where K : R — > R is some kernel function, i.e. K S L 2 (R) with JK = 1, and 
Kh(x) := h~ 1 K(x/h) for some bandwidth h > 0. Moreover, we require K to be band- 
limited, more specifically supp(J-"i^) C [—1,1], such that the integral only extends over the 
interval [~h ,h ]. If the true measure is v a \ and if the empirical characteristic function 
is close to the true one, then by Plancherel's theorem T n measures basically the distance 
J(Kh * (i/g-,1 — v a fi))(x) 2 dx. When the bandwidth h tends to zero, the distance tends to 
infinity unless v a \ — u ao possesses an L 2 -density whose squared L 2 -norm is then obtained. 
As a side remark we can also view FKh{u) = FK(hu) as a weight function w(U^ u) with 
U n = h~ l and interpret the test statistics just as a frequency weighted average in the spirit 
of the preceding section. 

To analyse the properties of T n under Hq, we linearize the dependence of ip" on 93. We set 

___ (p n -tp n „ _ ((p n - <p n )" - 2((p n - (p n )'A n ip' - (tpn - (p n )(A n tp" - A^ili') 2 ) 
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and obtain as 5 n := maxj = o,i,2|5 (u)\ — > 

Vi \ i" I \ l 9"(u)(l+g(u))-g'(u) 2 l( rri&\\ 

V> n (u)-V ( u ) = ^ (l + q(u)) 2 n ^ ( u ) + °( Sn >r 

Assuming finite moments of order 8, i.e. f x s v(dx) < oo, and using 

Varc(^'V)) < n-H&n + l(j = 0)), E[\0^(u) - ¥#>(«) I 4 ] < n~ 2 (A n + l(j = 0)) 
for derivatives of order j = 0, 1, 2, we infer 
\4'M-1/'(u)\ 2 = Op( * ., (A n +Ay(n) 2 +Ay(u) 2 +A^(«) 4 +n-VnWr 2 )) 

xn/A n\ l Pn\U)\ / 

uniformly in u. Together with HT 7 ?^ 2 ^ = 0(h~ x ) and (^'("u)! i$ |t*|, \ip"(u)\ < 1 for \u\ — > oo 
this implies uniformly over Hq for h G (0, 1) 

T n = O p ( max - fe - (l + A 2 /^ 2 + A^~ 4 + ( n A n )^Vn,oWr 2 )) • (4-1) 

The choice of the bandwidth h is at our disposal and we shall discuss the low-frequency setting 
and the case A n — > separately. To remain concise, formal proofs are omitted. They follow 
along the lines of the preceding sections. 

4.2 Asymptotically small observation distance 

If A n — > we may restrict to h n ^ A n which yields max, i <ft -i|v?n,o('")|^ 2 = 0(1). The 
advantage is that the bound ([4.ip under Hq simplifies considerably: 



.nA r _ 

We can gain efficiency when ||zv,o — ^<t,i||h s ^ R, that is v a ^ — v a ^_ has a Lebesgue 
density which lies in an L 2 -Sobolev ball of regularity s > and radius R > 0. Then standard 
approximation theory gives 

\\K h * (zv,o - ^,i)||l2 ^ \\u ff)0 - v ffjl \\ L 2 - \\u afi - v a>1 - K h * (u afi - u ff>1 )\\ L 2 
= \Wfffl - v*,i\\l 2 - 0{Rh s ). 

Hence, we obtain for h n = (nA n ) _1 /' 2s+1 ' (assuming h n > A n , i.e. A n < n~ 2jS// ( 2s+3 )) and 
for ||^,o,n - ^(r,i,n|Ua ^ r n with r n (nA n ) s /( 2s+1 ) -)■ oo 

A/nA^Hif^ * (v<r,o,n ~ Va,l,n)\\tf > \fnKJT n {r n - 0(Rh s n )) -> oo. 
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Under Hq the statistics h n nA n T n is bounded in probability and any test that rejects if 
nA n h n T n > K a for suitable n a > has asymptotically uniform level a G (0,1)- Under 
H\ the above bias bound yields nA n h n T n — > oo in probability such that asymptotically the 
test has power 1. For uniform power over H\ we require J(x 2 V x s )v(dx) ^ R over all triplets 
in Hi to make the above linearisation and stochastic bound work uniformly (note the decay 
bound Re(V>(u)) > -u C7 (R)u 2 /2 > -Ru 2 /2). 

The rate (nA n )~ s ' ( 2s+1 ) is certainly the optimal minimax rate for estimation since the 
jumps away from zero follow a compound Poisson process and even continuous-time observa- 
tion on [0, nA n ] of a compound Poisson process with jump density in H S (M) does not allow 
for faster estimation rates (it corresponds to density estimation with nA n observations). Note 
that for testing with respect to separating hypotheses in L 2 the rates for idealised regression 
models can be improved to (nA n ) _s '" 5+1 ' 2 ' (Ingster and Suslina [E]), which in our setting 
could only be established if we can show Var(T n ) ^ /i(E[T n ]) 2 and if we can perform an 
asymptotic bias correction for E[T n ] ^ 0. 

A comprehensive theory for estimating the jump density under asymptotically small obser- 
vation distances has been achieved by Figueroa-Lopez [10J, using sieve or projection methods 
on intervals bounded away from zero, excluding the small jumps. For continuous-time observa- 
tions the jumps on [0, nA n ] are observed directly and for sufficiently small A n the increments 
can be used as a proxy for the jump sizes away from zero. The rate (nA„) _s '' 2s+1 ' is achieved, 
an adaptive method via model selection is proposed and a minimax lower bound is proved in 
detail. The crucial condition on the speed A n — > in this paper is more than exponential in 
n and a much finer probabilistic analysis was needed to relax this condition to A n = o^^ 1 ' 3 ) 
(Figueroa-Lopez and Houdre |llj). to be compared with A n = 0(n~ 2 '^ 2s+3 >) in our case. 
This study of the discretisation error reveals a marvelous interplay between statistics and 
probability theory. Inference based on the second derivative ip'^ of the empirical characteris- 
tic exponent has been exploited by Comte and Genon-Catalot [9j to develop also a sieve or 
projection method with rate (nA n )~ s > ( 2s + 3 ) for estimating the smooth density x 2 v{dx) on R 
in the case a = 0. Adaptivity is again achieved by model selection, using sample splitting in 
addition. Closer related to the testing results obtained here are the confidence statements by 
Figueroa-Lopez [12] , but they are again based on approximating continuous-time observation 
statistics. 

Let us point out that for the (weighted) Levy measures u ff> j separation in other topologies 
than L 2 (M) for their Lebesgue densities are natural. In particular, we can be interested in 
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the separation rate for different volatilities cr 2 , and o\ or for the more general point measure 
case r n := (fo-,0 — ^lXl^o}) > for some xq £ R. If we assume one or finitely many point 
measures in v a $ — v a \ and otherwise a smooth density with IP-norm uniformly bounded by 
R > 0, we obtain as h — > 

\\K h * K, - v ff ,i)\\& > r n h- 1 ' 2 - 0(Rh s ). 

In order to separate H\ from Hq we need r n /(nA n ) -1 ' 2 — > oo for the same specification 
as before, that is h n = (nA n ) _1 '' 2s+1 ' and A n = 0(n~ 2 >( 2s+3 ') (here even non-positive 
smoothness s G (—1/2,0] is possible). Instead of point measures we can also consider jump 
densities with singularities (poles or cusps) of the type \v ao — v a ^\\{x) ~ r n \x — xo| 1_/3 for 
x — > xq and some /3 < 2, e.g. stable-like behaviour with index f3 of v at xq = 0. Then 
\\Kh * (fo-,0 — ^o-,i)||l 2 ^ r n h^~ 2 ^'' 2 — 0(Rh s ) holds, assuming .£P-norm at most R away from 
xq and s > (3— /3)/2. The above analysis yields similarly for r n the rate (nA n )~( s+/3 ~ 3 ' 2 >> ( 2s+1 ) . 
The rates are better than for differences with smooth densities and for xq ^ also optimal 
for estimation. Yet, they are worse than those obtained by the specifically designed tests from 
the previous two sections on the volatility and on the jump index because they are given in 
terms of the observation time nA n instead of the observation number n. Maybe a tighter 
bound on the stochastic error is possible for xq = 0. 

4.3 Low-frequency observations 

For low-frequency observations A n = 1 we cannot follow the previous route. We must choose 
h n — > and thus accept (^ n (/i~ 1 ) — > exponentially fast in h~ , assuming a 2 > 0. Let us 
therefore choose h n = (log(n c )) -1 ' 2 for some small c > such that (|4.ip under Hq becomes 

T n = Opih-^-Hh- 4 + n^o- 1 )) = Op(n~ £ ) for c < (1 - e)/a 2 . 

In the smooth jump density case ||iv,o — iV.illi^ ^ .R it is essential that r n — 0{Rh s n ) ^ and 
we necessarily obtain a logarithmic separation rate (logn) _s ' 2 . For point measure alternatives 
such that \\Kh * (1^,0 — ^o-,i)IIl 2 ^ rnh" 1 ' 2 — 0(Rh s ), imposing a uniform bound a 2 on a 2 , we 
obtain with ca 2 < 1 the separation rate (logn)~' s+1 ' 2 " 2 . 

In all these cases the stochastic error is negligible compared to the bias if c > is chosen 
sufficiently small. This phenomenon is well known for so-called severely ill-posed statistical 
inverse problems like deconvolution with a Gaussian density. In fact, At/4(^) = l Pn( u ) I t( Pn(u) 
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implies in the bounded variation case (f_ 1 \x\i'(dx) < oo and a = 0) for the law Pa of X& 
xP/\(dx) = A((b — / xv(dx) )5o(dx) + xv{dx) J *P/\(dx). 

Since we have empirical access to Pa, the characteristics are thus identified by deconvolving 
xP/±(dx) with P^(dx). In classical nonparametric statistics this inversion is usually also per- 
formed by division in the Fourier domain. The main new features from a pure statistical point 
of view are that xP/±{dx) and P&{dx) depend on the same unknown law (the problem is also 
coined auto-deconvolution) , the empirical versions are highly correlated and the measure of 
interest xu{dx) has some Levy-specific structure. A similar, but slightly more involved con- 
volution equation is obtained for general Levy processes using ip'n(u). The plausible intuition 
for this connection to deconvolution is that at low frequency sums of (an unknown number 
of) jumps are observed in the increments, moreover diluted by a Gaussian component, and it 
is at first sight very remarkable that consistent inference is possible at all. 

The problem of estimating the jump distribution from observations of a compound Poisson 
process has come up early in insurance mathematics and is called decompounding. Inference 
on the distribution function of v has been first studied rigorously by Buchmann and Griibel 
[7] who invert the nonlinear equation Pa = exp*(Az^)e in terms of the convolution expo- 
nential exp* . A first natural extension is to consider Levy processes of finite intensity, hence 
superpositions of compound Poisson processes and Brownian motion. Here, the main features 
of the general Levy case already appear and the weight function approach for the empirical 
characteristic exponent, initiated in Belomestny and ReiB [1] for option price data, was suc- 
cessfully applied to low frequency data by Gugushvili p3]. In both cases, logarithmic rates 
are obtained in the case a > for all parameters in the characteristic and for the intensity 
A = i/(R). The optimal convergence rates become slower along the order a 2 ,b, \,v which is 
clear from the influence on the characteristic exponent for high frequencies. 

The problem of inference on v a for general Levy processes was addressed by Neumann 
and ReiB [19] and extended by Kappus and ReiB [T7], using a minimum-distance estimator for 
empirical characteristic functions. The problem has been studied more thoroughly and with a 
focus on adaptive estimation by Kappus [18], improving results by Comte and Genon-Catalot 
[8j, which brings in new perspectives and methodology on model selection with unknown 
heteroskedastic noise structures. Nickl and ReiB [20J then address the fundamental inference 
problem on the distribution-type function N of u, defined by N(x) = v((—oo,x]) for x < 
and N(x) = v({x, oo)) for x > 0. Given a slow polynomial decay of the characteristic function, 
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a uniform central limit theorem for a natural plug- in estimator N n {x) is provided, i.e. for the 
corresponding empirical process {y/n{N n (x) — N(x)), \x\ > £), ( > 0, very much in the spirit 
of the classical Donsker theorem for the empirical distribution function. It is surprising that 
this result demands a very fine analysis of the underlying deconvolution operator by means of 
harmonic analysis and Fourier-multiplier theory as well as the theory of smoothed empirical 
processes. 

Inference on the Levy measure has many parallels with inference on the distribution of an 
i.i.d. sample and already very natural questions inspire and stimulate research in other areas 
like the theory of jump processes and semi-martingales, adaptation techniques, empirical pro- 
cesses or Fourier-integral operators. This calls for further exploration, both from a theoretical 
and applied perspective. 

Problem 4. Solve the basic statistical inference questions also for the Levy measure like 
natural analogues of goodness- of- fit tests (Kolmogorov-Smirnov, Cramer-von Mises), semi- 
parametric efficiency, empirical process classes. Focus especially on the small jumps. 
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